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DISCONTINUOUS COEFFICIENTS 
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Abstract. Convergence of stochastic processes with jumps to diffusion 
processes is investigated in the case when the limit process has discon- 
tinuous coefficients. An example is given in which the diffusion approxi- 
mation of a queueing model yields a diffusion process with discontinuous 
diffusion and drift coefficients. 



1. Introduction 

Suppose that we are given a sequence of semimartingales (x") t >o, n = 
1,2, with paths in the Skorokhod space V = V([0,oo),M. d ) of Revalued 
right-continuous functions on [0, oo) having left limits on (0, oo). If one can 
prove that the sequence of distributions Q n of x n on T> weakly converges 
to the distribution Q of a diffusion process (xt)t>o, then one says that the 
sequence of (xf)t>o admits a diffusion approximation. In this article by 
diffusion processes we mean solutions of ltd equations of the form 

x t = x + / b(s,x s )ds+ / y/a(s,x s ) dw s , 
Jo Jo 

with wt being a vector-valued Wiener process. Usually to investigate the 
question if in a particular situation there is a diffusion approximation one 
uses the general framework of convergence of semimartingales as developed 
for instance in §3, Ch. 8 of (also see the references in this book). 

The problem of diffusion approximation attracted attention of many re- 
searchers who obtained many deep and important results. The reason for 
this is that diffusion approximation is a quite efficient tool in stochastic sys- 



tems theory (see [ 12j , |L3| ) , in asymptotic analysis of queueing models under 
heavy traffic and bottleneck regimes (see H), in finding asymptotically opti- 
mal filters (sec |lq[ , fig] ) , in asymptotical optimization in stochastic control 
problems (see fig] , p9| ), and in many other issues. 

In all above-mentioned references the coefficients a(t, x) and b(t, x) of 
the limit diffusion process are continuous in x. In part, this is dictated by 
the approach developed in §3, Ch. 8 of pCR . On the other hand, there are 
quite a few situations in which the limit process should have discontinuous 
coefficients. One of such situations is presented in P] where a queueing model 
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is considered. It was not possible to apply standard results and the authors 
only conjectured that the diffusion approximation should be a process with 
natural coefficients. Later this conjecture was rigorously proved in 0]. In 
[|l| and H only drift term is discontinuous. Another example of the limit 
diffusion with discontinuous both drift and diffusion coefficients is given in 
article Q on averaging principle for diffusion processes with null-recurrent 
fast component. 

The idea to circumvent the discontinuity of a and b is to try to show 
that the time spent by (t, Xt) in the set G of their discontinuity in x is zero. 
This turns out to be enough if outside of G the "coefficients" of converge 
"uniformly" to the coefficients of xt- By the way, even if all these hold, still 
the functionals 

/ a(t,y t )dt, / b(t,y t )dt, y.eV 
Jo Jo 

need not be continuous on the support of Q. This closes the route of "trivial" 
generalizing the result from §3, Ch. 8 of [20|. 

To estimate the time spent by xt we use an inequality similar to the 
following one 

E [ f(t,x t )dt<N\ [ [ f d+1 (t,x)dxdt) , (1.1) 

JO \Jo JR d J 

which is obtained in Q for nonnegative Borel /. Then upon assuming that 
G C (0, co) x W 1 has d + 1-dimensional Lebesgue measure zero and substi- 
tuting 1q in place of / in fll.l]) we get that indeed the time spent by (t, xt) 
in G is zero. However, for (|1.1|) to hold we need the process x t to be uni- 
formly nondegenerate which may be not convenient in some applications. 
Therefore, in Sec. |H| we prove a version of ( |1 . 1| ) , which allows us to get the 
conclusion about the time spent in G assuming that the process is nonde- 
generate only on G. In essence, our approach to diffusion approximation 
with discontinuous coefficients is close to the one from |TJ. However, details 
are quite different and we get more general results under less restrictive 
assumptions. In particular, we do not impose the linear growth condition. 
Neither do we assume that the second moments of Xq are bounded. The 
weak limits of processes with jumps appear in many other settings, in par- 
ticular, in Markov chain approximations in the theory of controlled diffusion 
processes, where, generally, the coefficients of are not supposed to con- 
verge to anything in any sense and yet the processes converge weakly to a 
process of diffusion type. 



We mention here Theorem 5.3 in Ch. 10 of [14] also bears on this matter 
in the particular case of Markov chain approximations in the theory of con- 
trolled diffusion processes. Clearly, there is no way to specify precisely the 
coefficients of all limit points in the general problem. Still one can obtain 
some nontrivial information and one may wonder if one can get anything 
from general results when we are additionally given that the coefficients do 
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converge on the major part of the space. In Remarks 2.6 and we show 
that this is not the case in what concerns Theorem 5.3 in Ch. 10 of [14]. 

Above we alluded to the "coefficients" of xf. By them we actually mean 
the local drift and the matrix of quadratic variation. We do not use any 
additional structure of x™ . In particular, the quadratic variation is just 
the sum of two terms: one coming from diffusion and another from jumps. 
Therefore unlike [|17|] we do not use any stochastic equations for x™. This 
allows us to neither introduce nor use any assumptions on the martingales 
driving these equations and their (usual) coefficients thus making the pre- 
sentation simpler and more general. On the other hand it is worth noting 
that the methods of (lTj may be more useful in other problems. Our inten- 
tion was not to cover all aspects of diffusion approximation but rather give 
a new method allowing us to treat discontinuous coefficients. In particular, 
we do not discuss uniqueness of solutions to the limit equation. This is a 
separate issue belonging to the theory of diffusion processes and we only 
mention article ||, where the reader can find a discussion of it. 

The paper is organizes as follows. In Section we prove our main results, 



Theorems |2.l| and 2.5 , about diffusion approximation. Their proofs rely on 
the estimate proved in Sec. [H] we have been talking about above. But even if 
the set G is empty, the results which we prove are the first ones of the kind. 



In Theorems and |2.5| there is no assumption about any control of 
y/a(t, x) and b(t,x) as |x| — > oo, but instead we assume that Q n converge 
weakly to Q. Therefore, in Sec. || we give a sufficient condition for precom- 
pactness of a sequence of distributions on Skorokhod space. Interestingly 
enough, this condition is different from those which one gets from [Q] and 
]p0| and again does not involve usual growth conditions. Sec. || contains 
an example of application of our results to a queueing model close to the 
one from (p, [01. We slightly modify the model from ]!], [01 and get the 
diffusion approximation with discontinuous drift and diffusion coefficients. 
To the best of our knowledge this is the first example when the diffusion 
approximation leads to discontinuous diffusion coefficients. 

The authors are sincerely grateful to the referees for many useful sugges- 
tions. 

2. The main results 
We use notions and notation from [20]. For each n = 1, 2, let 



(n n ,j^,j^,t > o,p n ) 

be a stochastic basis satisfying the "usual" assumptions. Let T> be the 
Skorokhod space or right-continuous Revalued functions xt given on [0, oo) 
and having left limits on (0, oo). As usual we endow T> with Skorokhod- 
Lindvall metric in which T> becomes a Polish space (see Theorem 2, §1, 
Ch. 6 of @). 

Suppose that for each n on J7 n we are given an ^""-semimartingale xf, 
t > 0, with trajectories in T>. Let (B n ,C n ,i ,n ) be the triple of predictable 
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characteristics of (x™,^) and [i n be its jump measure (see §1, Ch. 4 of 
lOll). Then 



4 1 = xl + 5 t n + x? c + / / a;0i n -i/ n )(d«dz) + / f xfi n {dsdx), 

JO J\x\<l JO J\x\>l 

where is a predictable process of locally bounded variation with Bq = 0, 
is a continuous local martingale with {x nc )t = C™, v n is the compensator 
of /U n . Define 

m « =x ™ + f f x fan _ v n^ dsdx ^ jn = f f x ^(dsdx) 
Jo J\x\<l Jo J\x\>l 

so that m" is a locally square-integrable martingale and 

x n = x n + B n + m n +j n_ ^ 

Assumption 2.1. (i) For each n on (0, oo) x V we are given an Revalued 
function 6 n = b n (t,y.) and a d x d matrix valued function a" = a n (t,y.) 
which is nonnegative and symmetric for any t and y. £ T>. The functions b n 
and a n are Borel measurable, (ii) For each r S [0, oo) there exists a locally 
integrable function L(r, t) given on [0, oo) such that L(r, t) increases in r 
and 

\b n (t,y.)\ + trace a n (t,y.) < L(r,t) (2.2) 
whenever t > 0, y. £ T>, and |yt| < r. (iii) We have 

B t n = /"* 6 n (s, x n ) (is, (m n ) t = 2 /* a n (s, x n ) ds. 

Remark 2.1. We have 

(mf = (x nc )j j + / / x i x j u n (dsdx) 

Jo J\x\<l 



and it follows from Assumption 2.1 that both summands on the right are 
absolutely continuous in t. In particular, they are continuous, which along 
with the continuity of B™ implies that is quasi leftcontinuous (see Theo- 
rem 1, §1, Ch. 4 of p|). 



Assumption 2.2. (i) On (0, oo) x M. d we are given an Revalued function 
b = b(t, x) and a dxd matrix valued function a = a(t, x) which is nonnegative 
and symmetric for any t and x. The functions b and a are Borel measurable. 

(ii) There exists a Borel set G C (0, oo) x M. d (perhaps empty) such that, 
for almost every t S (0,oo), for every x lying outside of the t-section G t := 
{x G M. d : (t, x) £ G} of G and any sequence y n 6 2?, which converges to a 
continuous function y. satisfying yi = x, it holds that 

&"(i,y n )^&(t,x), a"(t,y")^a(t,x). 
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Remark 2.2. It is easy to see that Assumption implies that for almost 
any t, the functions a(t, x) and b(t, x) are continuous on the set M rf \ Gt in 
the relative topology of this set. 



Also, Assumptions |2.1| and 2.2 obviously imply that 



\b(t, x)\ + trace a(t, x) < L(r, t) 
for almost every t G (0, oo) and all x satisfying \x\ < r, x G" Gt- 

Assumption 2.3. If G 7^ 0, then for almost each t 

(i) the set Gt has Lebesgue measure zero, 

(ii) for every x £ Gt and each sequence y n G V, which converges to a 
continuous function y. satisfying yt = x, we have 

lim^oo det a n {t, y n ) > 5(t, x) > 0, (2.3) 

where 5 is a Borel function. 



Remark 2.3. Condition (|2.3j ) is satisfied if, for instance, the processes are 
uniformly nondegenerate in a neighborhood of Gt- 

Assumption 2.4. For any T, e G (0, 00), and any a G (0, 1], it holds that 
lim P n (v n ({0,T] x B c a )) > e) = 0, 

where B a = {x € R d : \x\ < a}, B c a = {x G R d : \x\ > a}. 
Remark 2.4. Notice that for each a G (0, 1] and r, T G [0, 00) 

■= r i \x\ 3 i lxsl < r v n (d S dx) < [ t [ + r f 

JO J\x\<l JO J\x\<a Jo J\x\>a 

< a f T [ \x\ 2 I\ xA < r v n {dsdx) + v n ({Q,T] x ££)), 

Jo J\x\<l 

where according to Assumption 2.1 the first term on the right is less than 



2a / /|a; s |<r trace a n (s, x n ) ds < 2a / L(r,s)ds. 
Jo Jo 



It follows easily that, owing to Assumptions 2A and 2A, for each e > and 
r, T G [0, 00), we have 

lim P n (9? T > e) = 



and since 9™ T < 2 J* Q T L(r, s) ds, we also have E n 8™ T — > as n — > 00, where 
E n is the expectation sign relative to P n . 
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Remark 2.5. Define 

7 n = M{t > : |j t n | > 1}. (2.4) 
Then j n is an ^""-stopping time, and obviously j™ = for < t < 7™. 



Furthermore, by Lemma VI. 4. 22 of ||], Assumption |2.4| implies that 

P n ( 7 n < T) -> 

for each T G [0, 00). 



Theorem 2.1. in addition to Assumptions 2.1-2.4, suppose that the se- 
quence of distributions (Q n ) n >i of x n converges weakly on the Polish space 
T> to a measure Q. Then Q is the distribution of a solution of the ltd equa- 
tion 

xt = xo + / \j 2a(s, x s ) dw s + / b(s,x s )ds (2-5) 
Jo Jo 

defined on a probability space with wt being a d-dimensional Wiener process. 
Remark 2.6. Notice that there are no conditions on the values of a(t, x) 



and b(t,x) on the set G. Hence Theorem 2.1 holds if we replace a, b with 
any other Borel functions, which coincide with the original ones on the 
complement T of G. Of course, this can only happen if 

Ig(s, x s ) ds = (a.s.). 



This equality is proved in Lemma |2.4| . In particular, xt satisfies 

x t = x + / I r (s,x s )y / 2a(s,x s ) dw s + / I r {s,x s )b(s,x s ) ds. (2.6) 
J Jo 



Thus, the limit process satisfies fl2.6|) . A particular feature of this equa- 
tion is that generally its solutions are not unique. Indeed, let x' t be a one- 
dimensional Wiener process wt and x'( the process identically equal to zero. 
They both satisfy dx t = ^2a(t, x t ) dw t , where a(t, x) = 1/2 for (t,x) G, 
a(t,x) = for (t, x) £ G, and G = [0,oo) x {0}. Of course, there are 
many more different solutions which spend some time at zero then follow 
the trajectories of wt for a while and then again stay at zero. Therefore, the 
statement that xt has the form 

xt = xq + / y/2a~ s dw s + / b s ds, 
Jo Jo 

where a s = a(s,x s ) and b s = b(s,x s ) whenever (s,x s ) G and a and b are 



not specified otherwise (cf. the first part of Theorem 5.3 in Ch. 10 of [14]), 
contains very little information on the process: in the above example both 
x' t and x'l have this form. In contrast with this always in the above example, 
the fact that without changing xt one can change o, b on G in any way, and 
thus take a = 1/2, leaves only one possibility: xt = wt- 
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Remark 2.7. From Remark [2.6| we also see that the assumption that ( |2.5| ) 
has a unique (weak or strong) solution makes no sense unless the values 
of a(t,x) and b(t,x) are specified everywhere. In Theorem 5.3 in Ch. 10 of 
fL4| an attempt is presented to specify a(t, x) and b(t, x) on G consisting of 
requiring that they belong to the set of all possible diffusion and drift coef- 
ficients of x t when x t G Gt- Generally, the set xt £ Gt has zero probability 
(say, for the Wiener process) and the requirement seems to have little sense. 
Nevertheless, it is natural to assume that, if x± = wt in the example from 



Remark 2.6, then the only possibility for a(t, 0) is 1/2, the same value as for 



all other x. 

In that case, the equation dxt = y / 2a(t, xt) dwt (= dwt) with zero initial 
condition has a unique solution, the distribution of which (by Theorem |2.l| ) 
is the weak limit of the distributions of solutions to dx™ = y/2a n (xt) dwt with 
zero initial condition, where a n (x) = 1/2 for \x\ > 1/n and a n {x) = 1/3 for 
\x\ < 1/n. 

Hovewer, this fact does not imply that the distributions of any other 
processes z n converge to the Wiener measure, provided only that zf satisfy 
z% = and dz? = ^/2c n (z?) dw t with c n (x) = a n (x) for \x\ > 1/n, c n > 0, 
and sup n x c n (x) < oo. To show this, it suffices to define c n (x) = n 2 x 2 for 
\x\ < 1/n and notice that = for all n. 

This somewhat contradicts the second part of Theorem 5.3 in Ch. 10 of 

0- 



The proof of Theorem I2JJ consists of several steps throughout which we 
assume that the conditions of this theorem are satisfied. 

The idea is to rewrite Q2.5| ) in terms of the martingale problem of Stroock- 
Varadhan. Then naturally we also want to write the information about xf in 
a martingale form not involving stochastic bases and convenient to passing 
to the limit. This is done in Lemma |2.2| . After that we pass to the limit and 
in Lemma |2.3| derive our theorem upon additionally assuming that the time 
spent by the limit process (t,xt) in the set G of possible discontinuities of 
coefficients is zero. This additional assumption holds, for instance, if G = 0. 



Lemma 2.4 concludes the proof of the theorem. 



After that in Theorem 12.51 we extend Theorem 2.1 to cases in which uni 



form nondegeneracy on Gt of diffusion is not required. We show the useful- 



ness of Theorem 12.51 in Remark 4.3 



As any probability measure on 2?, the measure Q is the distribution on T> 
of a process x. having trajectories in D and defined on a probability space. 
By E we denote the expectation sign associated with that probability space. 
We will see that the theorem holds for this x. up to a possible enlargement of 
the probability space on which x. lives. In the following lemma Assumptions 
2] and |2^ are not used. 

By C^°(M. d+1 ) we denote the set of all infinitely differentiable real- valued 
function u = u(t,x) on R d+1 with compact support. 



s 



N. V. KRYLOV AND R. LIPTSER 



Lemma 2.2. For any < ii < ... < t q < s < t < oo, continuous bounded 
function f on R qd , and u £ C^(R d+1 ), we have 

Ef(x tl ,...,x tq )[u(t,x t ) - u(s,x s )] 

= \im n ^ 00 E n f(xl,...,xl)J t s [u p (p,x;) +a n ^(p,x n )u x ^(p,x;) 

+b ni (p,x n )u xi (p,xZ))di2.7) 

Furthermore, the integrand with respect to p is less than NL(r,p), where the 
constants N and r depend only on u but not on uj and n. 

Proof. Denote 



n _ -n 
x t Jt ' 



and for any process zt on £l n denote (whenever it makes sense 
M?(z.) := u(t, zt) - u(0,z ) - / u t (s,z s )ds- u x i(s,z s )dB 



ni 
s 



-(1/2) / u xixj (s,z s )d(m n y>, (2.8) 
J o 

Ps(z.,x) = u(s,z s + x) - u(s,z s ) - x l u x i(s,z s ) - (l/2)x l x j u x i xj (s,z s ), 

H£(z.)= I [ P n s {z.,x)v n (dsdx). (2.9) 

JO J\x\<\ 

Notice that, by Ito's formula (see Theorem 1, §3, Ch. 2 of p0| ) the process 
M™(z n ) — R™(z n ) is a local ^""-martingale. To be more precise Theorem 1, 
§3, Ch. 2 of goj says that 

M?(z?) - Rt(z n ) = H s > z s) ~ u ( s , <-) ~ uA*> 

0<s<t 

— / [u(s, Zg + x) - u(s, Zg) - x l u xl (s, z™)] v n (dsdx) 

J0 J\x\<l 

ft 

Here the last term is a local martingale as is any stochastic integral with 
respect to a local martingale and the sum of remaining terms equals 

/ [u(s, z™_ + x) — u(s, Zg_) — x t u x i(s, Zg_)] j2(dsdx) 

J\x\<l 

which is the stochastic integral with respect to the martingale measure p, = 
p — v and thus also is a local martingale. 

Take the ^-""-stopping time 7™ introduced in (|2.4j ). Then 

M t % n (z n ) - R? Ajn (z n ) 

is again a local martingale. It turns out that, for each T £ [0, 00), the 
trajectories of Mp Al n(z n ), t G [0, T], are bounded and even uniformly in n, 



f u xl {s,x n s _)dmf. 
Jo 
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Indeed, let r be such that u(t,x) = for \x\ > r. Notice that z™ = x™ for 
< t < 7 n . Then we find 

rtAf n rtA-y n 

/ u x i(s,z?)dB?= / u x ,(s,x^b m (s,x n )ds, 
Jo Jo 

where 

\u xi (s,x™)b ni (s,x n )\=0 
if \x s \ >r (since u(t,x) = for \x\ > r) and 

\u x i(s,x™)b m (s,x n )\ < L(r,s) sup \u x (s,x)\ 

s,x 

if \%s\ < t (see Assumption 2.1). Therefore, 

tA7 n ft 

u x i(s, Zg) dB™ < sup \u x (s, x)\ / L(r,s)ds. 



Similarly one treats the integrals with respect to (ra n )s J . As long as R^(z n ) 
is concerned we notice that, for |x| < 1 and < t < r y n , we have 

\p n s {z n ,x)\ < AHx|%n|< r + 1 = Nlxfl^n^!, 

where the constant N can be expressed in terms of the third-order derivatives 
of u only. Therefore, 

\R?A T (z n )\<N9? +1>T , 
where 6™ T is introduced in Remark 2/4. By this remark for any t we have 
E\B% M n{z n )\ -> 0. It follows that E n \R? Ayn (z n )\ < oo, so that the local 
martingale M^ n {z n ) — i?™ A7 n(z n ) is in fact a martingale. 
Hence, 

E n f(xl, ...,x?J[M^ n (z n ) - R^(z n ) ~ (M^ n (z n ) - R^(z n ))] = 0. 
Since E n \Rf Al „(z n )\ — > 0, we also have 

n lim ^/(xJi, ...,xr 9 )[M t V(z") - M s V(z™)] = 0. 

Furthermore, due to Remark P(^f n < T) — > as n — ► oo for each 
T £ [0, oo). In light of this fact and by virtue of the uniform boundedness 
of M" 7 n(z n ), we obtain 

lim £ n |M™ (z n ) - M™ (z n )\L ln<t = 0, (2.10) 

so that 

lim E"/(^,...,^)[M^ n )-M s "(^)]/ t<7 » =0. 

n— >oo * L 

In addition, obviously, M[ l (z n ) = M^{x n ) for t < "f n and in the same way as 
above one can prove that the trajectories of M t n (x n ), t € [0, T], are uniformly 
bounded in n for each T. It follows that pTQ| ) holds with i,s, x n in place 
of t A 7 n , s A 7 n , z n , respectively. Thus, 

lim E n f{xl : ...,xZ)[M?{x n ) - M2(x n )] = 
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which is rewritten as (2.7). The asserted boundedness of the integrand in 
( |2.7D follows easily from the above argument. The lemma is proved. 

After we have exploited stochastic bases (O n , T n , J-^, t > 0, P n ), we will 
pass to processes defined on the same probability space. We are going to 
rely upon two facts. First we know from Theorem 1, §5, Ch. 6 of [^] that, 
owing to Assumption 2.4, Q is concentrated on the space of continuous Re- 
valued functions defined on [0, oo). Second, remember that if y n — > y. in D 
and y. is continuous, then \y n — y.\l — > for any t < oo, where 



sup \y r \. 

r<t 



Owing to these facts and Skorokhod's embedding theorem (see §6, Ch. 1 of 
we may assume that all the processes x n , n = 1,2, are given on the 
same probability space and there is a continuous process Xt such that (a.s.) 

(2.11) 



lim sup|x^ -x t \ = VTG[0,oo). 

n^co t<T 



Lemma 2.3. Assume that for any T 

fT 



E 



I G (t,x t )dt = 0, 



(2.12) 



o 



which is certainly true if G = 0. Then the assertion of Theorem 2.1 holds. 

Proof. As explained before the lemma we can write E in place of E n 
in (2/7). Then we insert I Xp ^G p ^ which is harmless due to ( 2.12 ), in the 
integral in ( [2.7| ) (notice x p and not x"). Furthermore, we remember the last 
assertion of Lemma |2.2| and use Assumption 2J , (|2.11|) , and the dominated 
convergence theorem to conclude that the limit in (|2.7|) equals 



Ef(x tl ,...,x tq ) J I Xp ^G p [a i:l (p,Xp)u x i x i(p,Xp) 

+b l (p,Xp)u x i(p,x p ) +u p (p,x p )] dp. (2.13) 
By using ( [2.12 ) again, we obtain that 

Ef(x h , ...,x tq )[u(t, x t ) - u(s, x s )] 

= Ef(x tl , ...,x tq ) / [up(p,x p ) + a %:i {p,Xp)u xixi {p,Xp) 

J s 

+b l (p,x p )u xl (p,Xp)] dp, 

for any bounded continuous / and U < s < t. The latter just amounts to 
saying that the process 



u{t,x t ) 



[u s (s,x s ) + a l - J (s,x s )n a , I;r i(s,a; s ) + tf(s, x s )u x i(s, x s )] ds 



is an .Ff-martingale, where J-f is the <T-field generated by s s , s < t. It 
only remains to remember the Levy-Doob-Stroock-Varadhan characteriza- 
tion theorem (see, for instance, Sec. 4.5 in [22] or Sees. 2.6 and 2.7 in ||). 
The lemma is proved. 
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Remark 2.8. In the general case the above proof and Fatou's theorem show 
that, if / is nonnegative, then 

Ef(x tl ,...,x tq )[u(t,x t ) - u(s,x s )] 
< Ef{x tl , ...,x tq ) I I Xp ^G p [up{p,x p ) + aV(p,x p )u x i xj (p,x p ) 

J s 

, Xp)ll x i (p,x p )]dp + I, (2.14) 



where 



I = Ef(x tl ,...,x t ) 



1 x v aG v hm n ^oo [a mj (p, x n )u x i xj (p, x p ) 
+b ni (p,x n )u xi (p,x%) + u p (p, x%)] dp. 



(2.15) 



In the following lemma we finish proving Theorem |2.l| . At this moment 
we take Theorem [O for granted. 



Lemma 2.4. Equation ( 2.1Z. ) holds and hence, by Lemma 2.S , Theorem 2A 
holds true as well. 



Proof. First, we estimate the lim in ( 2.15 ). Fix u and almost any p 
for which ( |2.3D holds with p in place of t and x p (oS) G G p . Then we can 
replace lim n _^oo with lim , where re' is an appropriate sequence tending to 

n'— >oo 

infinity. By extracting further subsequences when necessary we may assume 
that a n (p, x n ) and b n (p, x n ) converge to some a and b. Since x p G G p and 



0, (|2.3|) implies that deta > 5(p,x p ). In addition, 
|6| + trace a < L(\x p \ + l,p) 



due to Assumption 2A. Combined with deta > 5(p,x p ) this yields 

o^'A^ > 5{p,x p )L^ d - l \\x p \ + l,p)|A| 2 =: 5(p,x p )\X\ 2 > 5(p,x p )\X\ 2 

for all A G W d , where 5 = IgS. Now by replacing 5 with 5 and both K (r, t) 
and L(r, t) with L(r + 1, t) in Sec. pi we conclude that 

Y^ n ^ 00 [a nij {p,x n )u xlxJ {p,x^) + b n \p,x n )u^{p, a£) + Up(p,a£)] 

< u p (p,x p ) + F(p,x p ,u xx (p,x p )) +L(\x p \ + l,p)|«x(P)»p)|- 

Furthermore, Remark s hows that the same estimate holds for the 
expression in brackets in ( |2.13| ), so that according to ( |2.14 ) 

Ef(x tl ,...,x tq )[u(t,xt)-u(s,x s )] < Ef(x tl ,...,x tq ) / [up(p,x p ) 

J s 

+F(p,x p ,u xx (p,x p )) + L(\x p \ + l,p)\u x (p,x p )] dp 
if / > 0. Hence the process 

u(t,x t )- / [u s (s,x s ) + F(s,x s ,u xx (s,x s )) + L(\x s \ + l,s)\u x (s, x s )] ds 
Jo 
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is a super martingale and by Theorem 5.1 estimate (5.2) holds. If we take 



there / = Iq and remember that the Lebesgue measure of G is zero and 
5(t, x) > on Gt for almost all t, then we come t o (|2.12j ) with T A r r in 
place of T. Upon letting r — > oo we finally obtain ( |2.12j ) as is. The lemma 
is proved. 



The following theorem is used in Remark 4.3. Its proof is obtained by 



changing variables. We introduce an assumption different from Assump- 



tion 2.3 



Assumption 2.5. If G ^ 0, then G = \J^ =1 G m , where G m are Borel 
sets. For each m, we are given an integer d m > 1, a nonnegative Borel 
function 5 m defined on (0, oo) x M rfm , and a continuous M dm -valued function 
v m (t, x) = (v ml (t, x), v mdm (t, x)) defined on [0, oo) x R d and having there 
continuous in (t,x) derivatives v™ 1 , v ™ l , v™ . For each m and almost every 
t £ (0,oo), 

(i) the set f m (t,G^) has d m -dimensional Lebesgue measure zero, 

(ii) for every x S v m (t, Gf 1 ) and each sequence y n £ T>, which converges 
to a continuous function y. satisfying v m (t,yt) = x, we have 

Km^oo det V mn (t, y n ) > 5 m (t, x) > 0, (2.16) 

where the matrix V mn (t, y.) is defined according to 

V™{t,y.) = v^{t,y t )v^{t,y t )a nkr {t iy .) i,j = l,...,d m . 



Remark 2.9. Assumption |2.3| is stronger than Assumption |2.5| . Indeed, if 
the former is satisfied, one can take G m = G, 5 m (t,x) = 5(t,x), d m = d, 
and v m% = x\ i = 1, d, in which case det V mn = det a n . 



Remark 2.10. Another case is when again everything is independent of m, 
but d m = 1 and v(t,x) = x 1 . Then condition ( 2.16 ) becomes 

lim™^ 11 ^) >5(t,x) >0, 



which is much weaker than ( |2.3[ ). However, in this case in order to satisfy 
requirement (i) of Assumption 2.5 we need to assume that Gt lies in a 
hyperplane orthogonal to the first coordinate axis. 



Remark 2.11. Assume that G = Um=l wri ere G™ are independent of t 
and are hyperplanes G™ = {x : (x,a m ) = j3 m } with certain a m G M. d and 
(3 m G H satisfying \a m \ = 1. Assume that we have a Borel nonnegative 
functions 5 m (t,x), x G M. Finally, assume that for every m > l,t > 0, 
x G R d such that 

(x, o m ) — 0m: 

and each sequence y n G V, which converges to a continuous function y. 
satisfying yt = x, we have 

fim^ooa^^,^)^^ > 5(t,(3 m ) > 0. 
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Then it turns out that Assumption |2.5| is satisfied. To show this, it 
suffices to take d m = 1 and v m (t,x) = (x,a m ) and notice that the image of 
G™ under the mapping v m (t, •) : — > R is just one point (3 m . We will use 
this fact in Sec. ||. 



Remark 2.12. Generally, condition ( 2.16| ) is aimed at situations in which x 



in the limit may degenerate in some directions but not along all those which 
are transversal to Gt- 



Theorem 2.5. Suppose that Assumptions ^,. \ , pO) , 2.4, and \2. 3^ are satisfied 
and the sequence of distributions (Q n ) n >i of x n converges weakly on D to a 
measure Q. Then the assertion of Theorem \2.\ holds true again. 



Proof. We mimic the argument from the proof of Lemma 



2.4 to show 



that ( 2.12 ) holds if Assumption 2J3 rather than Assumption is satisfied. 



The main idea is to change variables according to the mappings v m . 

It suffices to prove that, for each m, equation ( 2.1 2| ) holds with G m in 



place of G. Furthermore, without losing generality we may assume that each 
set G m is bounded otherwise we could split each of them into the union of 
bounded sets and consider them as new G m, s. We fix m,T, and R and 
assume that G m C [0, T] x Br. Then the behavior of v m (t,x) for large |x| 
becomes irrelevant and, changing v m outside of [0, T] x Br if necessary, we 
assume that 

v m (t,x) = ei \x\ (2.17) 

for (t, x) [0, 2T] x B2R , where e\ is the first basis vector in R rfm . It follows 
that there is a constant Nq < 00 such that 

\v™(t,x)\ + \v™ ixj (t,x)\ + \v?(t,x)\<N Vi,x. (2.18) 

It also follows that, for any r > 0, 

\v™(t,x)\<r=>\x\<2R + r. (2.19) 

After that we go back to Lemma |2.2| and take there 

u(t,x) = w(t,v m (t,x)), 

with w being a function of class C^°(K dm+1 ). By the way, our stipulation 
(|2.17j) about the behavior of v m for large |x| yields that u G Cg°(R dm+1 ). 
We also take the function / in the form 

f(y l7 ...,y q ) = g(v m (t uyi ),...,v m {t q ,y q )), 

where yi G M. d and g is a continuous bounded nonnegative function on 
Finally, we define 

xr = ^ m (t,xD, x t = v m (t,x t ). 

Notice that 

a nij (p, x n )u xlxj (p, x n ) + b ni (p,x n )u xl (p,x 1 l) + u p (p, <) 
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= a nkr (p, x n )w x k x r (p, 3%) + b nk {p, x n )w xi (p, x n v ) + w p (p, 5%), 
where, for y. G T>, 

a nkr (p,y.) = a n v(p,y.)v™ k (p,y p )vj(p,y p ), 

b nk (p, y.) = a n ^(p, y.)v% (p, y p ) + b ni (p, y.)v™\p, y p ) + v™ k (p, y p ). 
Then on the basis of Fatou's theorem and Lemma |2.2| we get 

Ef(x tl , ...,x tq )[u(t, x t ) - u(s, x s )] 



-- Ef(x tl ,...,x tq ) / lim n ^[u p (p, x p ) + a™ J (p,x n )u x i xj (p,x p ) 

J s 

+b ni (p, x n )u xl (p,x^)]dp 
Eg(x tl , x tq ) J lim n ^oo [w p (p, x p ) + a nkr (p, x n )w x k x r (p, x™) 
+b nk (p,x n )w xk (p,x p 1 )] dp. 



Also notice that owing to (|2.1§| ), a and b satisfy ( |2.2| ) with L(r, t) replaced 
with NoL(r,t). In light of ( |2. 19|) this implies 

\b n (t,x n )\ + trace d n {t,x n ) < N L(2R + \3$\,t). 



In addition, according to ( 2.1q ), for almost any t, for every x G f m (i, GJ") 
and each sequence y n G 2?, which converges to a continuous function y. 
satisfying v m (t, yt) = x, we have 

lim t),— >or> 

deta n (i,y n ) > <S TO (t,x) > 0, 

Km n _> 0O a n * p (t,y: i )A*A r > <5 m (t,S)|A| 2 
for all A G i? dm , where 

5 m (t,z) = 5 m (t,x)L^ (dm " 1) (2,R+ |x| + l,t)I vm{Gm) (t,x) 

Then as in the proof of Lemma |2.4| we find that 

Eg(x tl , ...,x tq )[w(t,x t ) - w(s,x s )] < Eg(x tl , x tq ) J [w p (p,x p ) 

+F(p, x p ,w xx (p,x p )) +L{2R + \x p \ + l,p)\w x (p,x p )] dp, 

where the operator F is constructed on the basis of 5 m and NoL(2R + r, t) in 
place of 5 and both L, K from Sec. || respectively, on the space of functions 
on W dm in place of M. d . Again as in the proof of Lemma 2A we conclude 
that, for any S we have 

E / I v m(Qm)(t, xt) dt = 0. 
Jo 

Since, obviously, I(jm{t,x) < I v m(Qm^ (t,v m (t,x)) we get that ( |2~ip holds 
with G m in place of C As we have pointed out in the beginning of the 
proof, this is exactly what we need. The theorem is proved. 
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3. A SUFFICIENT CONDITION FOR PRECOMPACTNESS 



One of the conditions of Theorem 2.1 is that the sequence of distributions 



(Q n )n>i of x n on T> converge. One can always extract a convergent sub- 
sequence from a sequence which is precompact and here we want to give a 
simple sufficient condition for precompactness to hold. The assumptions of 
this section are somewhat different from the ones of Sec. [2] and this was the 
reason to treat the issue in a separate section. We take the objects intro- 
duced in Sec. |2] before Assumption 2A and instead of that assumption we 
require the following. 



Assumption 3.1. Assumption 2.1 is satisfied with condition (ii) replaced 
by the following weaker condition: For each r G [0, oo) there exists a locally 
integrable function L(r, t) given on [0, oo) such that L(r, t) increases in r 
and 

\b n (t,y.)\ + trace a n (t,y.) < L(r,t) 
whenever t > 0, y. G V, and sup s<t \y s \ < r. 



Lemma 3.1. Under Assumptions and \3. ][ suppose that we are given 



stopping times r™, n = 1,2, r > 0, and a finite function a(r) defined on 
(0, oo) such that we have (i) for all n and r, 

K\ < a(r) if 0<t< r r n , (3.1) 

and (ii) 

lim lim^ooP^T™ < T) = VT G [0, oo). (3.2) 

r— >oo 

Then the sequence (Q n ) n >i is precompact. 
Proof. Define 

G" = f [\b n (s,x n )\ + trace a n {s,x n )} ds, 
Jo 

F t n = G?+ If v n {dsdx) 

JO J\x\>l 



Owing to Assumption 2.4, by Theorem VI. 4. 18 and Remark VI. 4. 20 of Q to 
prove the theorem it suffices to check that the sequence of distributions on 
D of F n is C-tight, that is precompact and each limit point of this sequence 
is the distribution of a continuous process. In turn, due to Theorem VI. 4. 5 
and Remark VI. 4. 6 (3) of [0], to prove the C-tightness it suffices to show 
that, for any T G [0, oo) and e > 0, 

limjv^oo ESI n Wsup|F™| >N)=0, 

v t<T ' 



lim 5io lim n P n [ sup \F t n +s - F t n \ > e )= 0. (3.3) 

v t+s<T,0<s<6 
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In view of Assumption 2.4 we need only prove (3.3) for G n in place of F n . 
We do this replacement and after that notice that, for any r, the left-hand 
side of the first equation in ( |3.3[ ) is less than 



lim lim n P n ( sup \G?\>N)+ lim n^P 11 (r r n < T) . 

Here the first term is zero for each r since Gf is continuous in t and 

|G?| < ! L{a(r),u)du 
Jo 

for t < t™ when by our assumptions < r. In addition, the second term 
can be made as small as we wish by choosing a sufficiently large r. This 
proves the first equation in (|3.3|) . 

Similarly, the left-hand side of the second equation in (|3.3| ) with G n in 
place of F n is less than 

limImT n W sup \G? +S - Gf\ > e) + lh^ n ^ 00 P n (T? < T), 

S l° \t+s<TAT?,0<s<6 ' 

where again the first term vanishes since 

\G? +S -G?\< f' +S L(a(r),u)du. 
The lemma is proved. 

Remark 3.1. It may be worth noticing that the combination of assumptions 



(i) and (ii) of Lemma 3^ is equivalent to the following: for any T G (0, oo) 



the sequence of distributions of sup t<r is tight or put otherwise 
lim lim n _ +oc P n (sup \x?\ > r) = 0. 



Lemma 3.1 reduces the investigation of precompactness to estimating 
|x n |£. Here the following coercivity assumption turns out to be useful. 

Assumption 3.2. For any n, there exists a nonnegative ^""-predictable 
function L n (t) such that 

b ni (t, x n )xf + trace a n (t, x n ) < L n (t)(l + \x?\ 2 ) (3.4) 
for almost all (ui,t). Furthermore, for any T G [0, oo), 

lim lim n ^ 00 P n ( / LJt) dt > c) = 0. 
c->oo J Q 

Remark 3.2. Quite often one imposes a linear growth assumptions on the 
coefficients a n and b n , which of course implies (|3.4j). However, say in one 
dimension, if a n = and b m (t,y.) = b n (t,y t ) and b n (t,y t ) > for y t < 
and b n (t,yt) < for yt > 0, then ( |3.4j ) is satisfied with L = 0. Therefore 
generally ( |3.4[ ) does not provide any control on the behavior of \b n (t, yt)\ for 
large \y t \. 
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For that reason, Theorem 3.2 below does not follow from the results of 
1 and 



lim li mrwoo P n (K| > N) = (3.5) 

N— >oo 



Theorem 3.2. Let 



and let Assumptions ]2^, \3.lj and |3jJ be satisfied. Then the sequence 



(Q n )n>i is precompact. Furthermore, let k be an integer and f n (t,x) be 
Borel -valued functions defined on (0, oo) x M. d such that \f n (t, x)\ < 
L(\x\,t) for all t,x,n. Define 

y?= f f n (s,x™)ds. 



o 



Then the sequence of distributions of(x n ,y n ) on T>([0, oo), ]R d+fc ) is precom- 
pact as well. 



Proof. We are going to use a method introduced in Sec. 4, Ch. II of [11]. 
Define 

*t =xt~ 3?, Mt) = exp ( - 2 J L n (s) ds) , u n (t, x) = (1 + \x\ 2 )<f> n (t). 

Also as in the proof of Lemma ^L^, use notation fl2.£| ) and (|2.9| ) and notice 
that due to special choice of u, we have R"t(z.) = 0. 
Then by using Ito's formula, we get that the process 

M?:= (1 + K| 2 )^)-(1 + K| 2 ) 

- f [2zfb ni (s,x n ) + 2 trace a n (s,x n ) - 2L n {s){\ + |z s n | 2 )] n (s) ds (3.6) 

is a local martingale. 

Now take r f l again from fl2.4|) and remember that z™ = x™ for s < 7™, so 
that the expression in the brackets in (l3.6|) is negative due to Assumption 



3.2[ Then we see that 

fl? := (1 + kVI 2 )^(* A 7 ") - (1 + K| 2 ) 

is a local supermartingale. For any constant A > 0, the process H™I\ x n\ <N 
also is a local supermartingale and, since it is bounded from below by the 
constant —(1 + A 2 ), it is a supermartingale. Therefore, upon defining 

K n r = inf{t > : sup \x n s \ > r}, r™ = 7™ A «?, 

we get that, for any T E [0, 00), 

£"(1 + |^ AT „| 2 )<MT A t?)I\ x »\< n <l + N 2 , 

E n {l + \z?n\ 2 )MT?)IW\<N,T r n<T< r < 1 + A 2 . 
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Then we notice that on the interval [0, 7") the process jf is identically zero. 
Hence, for r ™ < T < 7™ we have 



and we obtain 



e~ c (l+r 2 )P n Q L n (t)dt<c,\x%\ < N,t? <T < -yA < 1 + N 2 , 
lim Hm^P^KI < AT,T r " < T < 7 n ) = 0. 



This holds for any iV and along with assumption (|3.q ) and Remark 2J5 
leads first to to 

lim lim"™^^™ <T< 7'™) = 



and then to (|3.2| ). 



Finally, observe that (3.1) is obviously satisfied even if < t < k™ rather 



than < t < t~ . Hence, by referring to Lemma 3.1 we finish proving the 



assertion of our theorem regarding the distributions of x n . 



Lemma 3T yields the result for (x n ,y n ) as well since, obviously, for < 
t < r A r r n , we have 

|y t n | < f L(r,s)ds. 
Jo 

The theorem is proved. 

4. An example of queueing model 

We consider a particular queueing system with d service stations and 
d + 1 incoming streams of customers. We refer the reader to [g] for relations 
of this system to practical problems. The first d streams are composed of 
customers "having appointments", meaning that the customers from the 
ith stream only go to the ith service station. The last stream, to which we 
assign number 0, is the one of "free" customers who, upon "checking in", are 
routed to the service stations according to certain rule to be described later. 
We assume that each service station consists of infinitely many servers, so 
that infinitely many customers can be served at each station simultaneously. 
Denote by Q\ the number of customers being served at the ith station at 
time t. 

With station i, i = 1, d, we associate a "cost" on > and suppose that 
a "free" customer arriving at time t is directed to the ith station if i is the 
smallest integer satisfying 

otiQ\_ < OLjQ° t _ for all j ^ i. 

Such a routing policy is called load-balancing in Q. Here and below in this 
section the summation convention over repeated indices is not enforced. 
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We take some numbers Ao, > and assume that the ith stream of 
customers forms a Poisson process with parameter Aj. To describe the ser- 
vice times we fix some "thresholds" N 1 , ...,N d , which are positive integers, 
and assume that, given < Q\ < N l , each of Q\ customers at the ith station 

(i) has its own server, 

(ii) spends with its server a random time having exponential distribution 
with parameter 1, 

(hi) after having been served leaves the system. 

However, given Q\ > N % , the service is organized differently. All Q\ 
customers are divided into disjoint groups each consisting of two persons 
apart from at most one group having only one member. Then each of those 
groups is supposed to get service according to the rules (i)-(iii) above. By 
the way, it is not hard to understand that on average both discipline of 
servicing yield the same number of customers having been served during 
one unit of time. 

Finally, we assume that all service times and arrival processes are as 
independent as they can be. 

Now we describe the model in rigorous terms. For any numbers y 1 , ...,y d 
define 

argmin y k = i 

k=l,...,d 

if i is the least of l,...,d such that y % < y k for k 7^ i. For x £ M. d and 
i = 1, d, let 

{1 if i = argmin otkX h , 
k=i,...,d 
otherwise. 

Take independent Poisson processes 11°,..., 11^ with parameters Ao,...,A^, 
respectively. Then we think of the number of arrivals at the ith. station as 
given by 

A\ = f <f(Q s _) dIL° s + lT t , 
J 

where Q s = {Ql,...,Q d ) and Q\ are some integer-valued right continuous 
processes having left limits. To model the number of departures D\ from 
the ith station up to time t we take Poisson processes IT^ and A 1 / , i = 1, d, 
j = 1, 2, having parameter 1 and mutually independent and independent 
of (IT , ...,U d ). Then we define 

j>i 

+ / I N i <Q\_ X] (^Qi_>2j + -^_+1>2j) d K j - 
j>l 

To be consistent with the description, Q t should satisfy the balance equa- 
tions Ql = Qq + A\ — D\. Thus, we are going to investigate the system of 
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equations 

dQl = FiQt-) dU° t + dU\ - I Ni>Qt _ I Q i_>j dUl 1 

i>i 

~^N i <Q\_ (^Qi_>2j + ^Ql_+i>2j) d&t i = 1, ••) d- (4.1) 

Needless to say that we assume that all the Poisson processes we are dealing 
with are given on a probability basis satisfying the "usual" assumptions. 
We also assume that the initial condition Q$ is independent of the Poisson 
processes. 

Notice that for any initial condition Qq there is a unique solution of (4.1). 
Indeed obviously, for any solution we have Q\ < Qq + TL® + n|, so that, while 
solving (4.1) for t £ [0,T], one can safely replace the infinite sums in ( |4.1| ) 
with the sums over j < Qq + Ily + 11^. After that one solves (4.1) on each 
uj noticing that between the jumps of the Poisson processes Qt is constant 
and the jumps of Qt themselves are given by (4.1). 



For obvious reasons we rewrite (4.1) in terms of representation (2.1). 
First, for k = 0, d, i = 1, ~.,d,j > 1, we define 

These processes are square integrable martingales with 

(n k ) t = x k t, (n.% = t, (A% = t. 

Next, for i = 1, ...,d, define 

mi = f 5\q s j) du° s + ni - f i Ni>Qi y, ^ >j 

Jo Jo 3 

ft 

which are at least locally square integrable martingales. Then after observ- 
ing that, for any integer q > 0, 

^2 J 5>i = ?> 5J(Jg>2j + Iq+l>2j) = Q, 



we turn equation (4.1) into the equation 

dQl = (\ 5\Qt) + Xi - Qt) dt + dMl (4.2) 

In order to explain what follows (in no way is this explanation used in 
the proof of Theorem |4.l| below) , notice that (|4.2| ) seems to imply that 

{EQ\)' = X ES\Q t ) + Aj - EQ\. (4.3) 

We are interested in the behavior of Qt when Aj's are large but Ao is much 
smaller than Ai,..., A<j. Then, on the one hand, EQ\ should be large for 
moderate t and, on the other hand, the first term on the right in ( [4.3| ) can 
be neglected. In that situation equation ( |4.3| ) turns out to have a stable point 
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EQ\ = Aj. This means that, if for the initial condition we have EQ l = A,,, 
then EQ\ = \ for all t. Notice that since Aj's are large, so should be EQ l . 

Therefore, we write A; = \ + AAj, where AAj will be assumed to have 
order of Ao, denote 

Ql = Ql ~ ^ 

and rewrite ( [L2| ) in terms of Qt- At this moment we introduce the assump- 
tion that 

Ajtti = n, i = l,...,d, (4.4) 

with n being an integer (independent of i) to be sent to infinity. This is 
convenient due to the simple fact that then 

5\x) = 5\x - A). 

In this notation (|4.2| ) becomes 

dQ\ = (\oS\Qt) + A A, - Q\) dt + dM\. 

To understand what kind of normalization is natural we compute the qua- 
dratic characteristics of M\. Notice that, for any integer q > 0, we have 

y^Ug>2j + Iq+l>2j) 2 = y~](Jg>2j + 2Iq>2j + Iq+l>2j) 

= 3[g/2] + [(g + l)/2] =: qf(q), 

where [a] is the integer part of a. By the way, we can only define f(q) by 
the above formula for all real q > 0. If q < 0, we let f(q) = 0. Then 

< / < 2, lim f(q) = 2. (4.5) 

It follows that 

d(M)f = [\o5\Qt) + X l + QlI Qi<Ni + Qif(Qt)I Qi > Ni ] dt. 

Also due to independence of our Poisson processes and the fact that 5 l 5 J = 
for i j, we get 

(M)l j =0 for i ^ j. 

If we believe that, in a sense, Q\ ~ Aj, then Q\j\ should converge as well 
as Ml/y/Xi, and we see that it is natural to expect Qt/wXi to converge to 
certain limit. To make the model more meaningful we also assume that the 
thresholds A^'s are large and roughly speaking proportional to A,. In this 
way we convince ourselves that the following result seems natural. 

Theorem 4.1. Let ai,...,ad > and fio, /i^ > and v\,...,v& G E be 
fixed 'parameters. For n = 1,2, ... define 

A, = na^ 1 + fiiVn, i = 1, d, A = ^Vn, 
A 1 = na" 1 + fj-y/n, i = 1, d. 
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Let Qt = Q™ be the solution of ((A) with certain initial condition indepen- 
dent of the Poisson processes and introduce 



n 



-1/2 



(Q 



Hi 
t 



na 



l 

l > 



Q 



nd 
t 



na„ 



Let Q n be the distribution of x n on T> . Finally, assume that the distribution 
of Xq weakly converges to a distribution Fq as n — > oo . 

Then, as n — > oo, Q n converges weakly to the distribution of a solution of 
the following system 

dx\ = (^(xt) + ^ - x\) dt + a~ 1/2 (2 + L xl ^ 2 dw\, i = 1, d 

(4.6) 

considered on some probability space with wt being a d-dimensional Wiener 
process and Xq distributed according to Fq. 



Proof. First of all notice that (|4.q ) has solutions on appropriate prob- 
ability spaces and any solution has the same distribution on the space of 
M. d - valued continuous functions. This follows from the fact that an obvious 
change of probability measure allows us to consider the case with no drift 
terms in (|4.6| ). In that case ( [4.6| ) becomes just a collection of unrelated one- 
dimensional equations with uniformly nondegenerate and bounded diffusion. 
Weak unique solvability of such equations is a very well known fact (see, for 
instance, Theorems 2 and 3 of Q). 



In the proof of convergence we will be using Theorems 3.2 and 2.1. Ob- 
serve that Assumption is satisfied since xf 1 has no jumps bigger than 
2 n -i/2 anc [ j/™((o, oo) x B^) = if n > Ad/a 2 . Furthermore, if in the ar- 
gument before the theorem we take Aj = na^ 1 , so that (4.4) holds, and let 
AAj = \ii\fn, then after noticing that, by definition, 



Q r 



n 



1/2 ™m 



xT + na- 



we easily obtain 



dx n t 



b n {x n t )dt + dm n t 
where 

b ni (x) = f i 5 i (x) + fi l -x i , a nij 
+ (x l n 



(m n ) 



a n {x n s )ds, 



(4.7) 



( x ) = #3{ n - 1 l 2 ii #{x) + a" 1 + W n~ 1/2 



- 1/2 + a- 1 : 



[I x i <v i + f{n 1/2 x l + na i l )I x i> v i\). 
Upon remembering ( |4.5| ) we see that, for a constant N and all n and x, 
we have |6 n (x)| + trace a n (x) < N(l + |x|), which shows that Assumptions 
2.1 and 3.1, equivalent in our present situation, and Assumption 3.2 are 
satisfied. By Theorem 3.2 the sequence (Q n ) is precompact. 



Next, obviously Assumption 2.2 is satisfied if we take 



G = {(t,x) : t > 0, Yl fax* - a j x j )(x i - = 0}, 
«,i=i 

b l (x) = hq5\x) +fH- x\ a l3 {x) = 5 l3 a^ l {l + I x i <v i + 2I x i> v i). 
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Finally, Assumption O is satisfied since det a n {x) > a^ 1 ■ ... ■ a^ 1 every- 
where. 

By Theorem 2.1 every convergent subsequence of (Q n ) converges to the 
distribution of a solution of ( |4.6| ) with the above specified initial distribution. 
Since all such solutions have the same distribution, the whole sequence (Q n ) 
converges to the distribution of any solution of Q4,6|). The theorem is proved. 



Remark 4.1. In Theorem 4.1 we assume that Q™ g° es to infinity with cer- 
tain rate, namely Q™ ~ raa" 1 . Interestingly enough, if we change the rate, 
the diffusion approximation changes. Indeed, keep all the assumption of 
Theorem iA apart from the assumption that Xq converges in distribution 
and instead assume that, for a 7 € [0, 00) say for 7 = 0, 



11 



- 1/2 (Q£ 1 -n 7 arV-->Qo d -'W 



converges in law to a random vector. Notice that the case 7 = 1 is covered 
by Theorem 4.1. We claim that, for 7 > 1, the processes 

V? = n-^iQf - nq t a^ 1 , Q? - nq t af), 

where qt = 1 + (7 — l)e - ', weakly converge to a solution of the system 

dy\ = (jm&iyt) vl) dt + a; 1/2 (l + q t ) 1/2 dw\, i = 1, d, 

and for 7 £ [0, 1) weakly converge to a solution of 

dy\ = (jMiPiVt) + fM ~ y\) dt + a~ 1/2 (l + 2ft) 1 / 2 dw\, %= 1, .... d. 
Indeed, we have 

Q ni = n l/2 y ni + nqta -l^ dqt = (1 _ Qt) ^ 

dy n t = b n (y?) dt + dm?, (m n ) t = f a n (y?) ds, 

Jo 

where 

b ni {x) = 1*0$ (x) x\ a nij (x) = 5 ij (n-^fiod^x) + aj l 



1/2 + (x l n 1/2 + q tai x )+ [i( 7 _i) e -* <a4 (^_ a .i) n -i/a 
+/(n 1/2 x l +^a J " 1 )/ (7 _ 1)e _i> Qi(l/l _ ;cl)n _i/2]^. 



As in the proof of Theorem |4.1| one checks that the sequence of distribu- 
tions of y n is precompact. Furthermore, obviously, for any x 



a nij {x) 



S^a-\l + q t ) if 7 <1, 
5^aT l (l + 2q t ) if 7 >1, 



and this yields our claim in the same way as in the proof of Theorem 4.1 
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Remark 4.2. We tried to explain before the proof of Theorem 4.1 why its 



statement looks natural. Now we can also explain how the function qt from 



Remark 4.1 was found. The explanations is based on a kind of law of 
large numbers which in queueing theory is associated with so-called "fluid 
approximations". Generally, "fluid approximations" can also be derived 
from Theorems 3.2 and 2.1. For instance, if A^ = Afc(n) and \f.(n)/n — > /3k 



as n — > oo, and /3q = 0, then under the condition that Q^/n converges in 
probability to a constant vector, the processes Qf/n converge in probability 
uniformly on each finite time interval to the deterministic solution of the 
system 

dqj = (Pi - q l ) dt, i = l,...,d. 

This fact obviously follows from Theorems [T^ and 2A applied to (|4.2| ) 
written in terms of z™ := Q^/n: 

dzf = b ni {z?)dt + dMf, 

with d(M n )i j = a%(z?)dt, 

b ni ( x ) = 6 i (x)X /n + Xi/n - x, \af j (x)\ < iVn _1 (l + |x|), 

where the constant N is independent of x,n,t. 

The following observation can be generalized so as to be used in various 
control problems in which optimal controls are discontinuous with respect 
to space variables. 

Remark 4.3. It turns out that many discontinuous functionals of x n con- 
verge in law to corresponding functionals of x.. For instance take a Borel 
vector- valued function f(x) on M. d such that the set of its discontinuities lies 
in a closed set J C M rf having Lebesgue measure zero. Also assume that / 
is locally bounded, that is bounded on any ball in M. d but may behave in 
any way at infinity. As an example, one can take f(x) = (5 1 (x), ...,S d (x)). 
Then, for 

Vt-= f f«)ds, y t := f f(x s )ds 
Jo Jo 

we have that the distributions of (x n , y n ) converge weakly to the distribution 

of (x.,y.). 

Indeed, append (|4.7| ) with one more equation: dyf = f(xf) dt and con- 
sider the couple z n = (x n ,y n ) as a process in Obviously Assumptions 



2.1| and Assumptions |2.4| are satisfied for thus obtained couple. 
Furthermore, define 



d 

H = {(t,x,y) : t > 0,y G R, x£j or J (ojX 4 - ctjX j )(x l - Vi) = 0}. 

Since J is closed, for any t > and (x, y) H t , the function / (independent 
of y) is continuous in a neighborhood of x, which along with the argument 
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in the proof of Theorem 4.1 shows that Assumption 2.2 is satisfied for z™. 
Finally, for 

H rn = H, d m = d, v mi (t,x,y) = x\ i = l,...,d, 

we have 



v m, 



(H t ) = {x : x G J or J (c^x 1 - ajx j )(x l - v£) = 0} 

which has d-dimensional Lebesgue measure zero and 

det V nm (t,x n ,y n ) = det a n (x") > af 1 • ... • af > 0. 
Hence Assumption [2.5| is satisfied as well. This along with precompactness 



of distributions of (x n ,y n ) guaranteed by Theorem 3.2 and along with The 



orem 2J5 shows that any convergent subsequence of distributions of (x n , y n ) 
converges to the distribution of a process (x.,y.), whose first component 
satisfies fl4.6|) and the second one obeys dyt = f(xt)dt. 

Thus, we get our assertion for a subsequence instead of the whole se- 
quence. However, as we have noticed above, solutions of Q4.6D are weakly 
unique and this obviously implies that solutions of the system ( [4.6[ ) ap- 
pended with dyt = f(xt)dt are also weakly unique. Therefore, the whole 
sequence of distributions of (x n ,y n ) converges. 

5. An L p ESTIMATE 

Let d > 1 be an integer, (Q, J 7 , P) be a complete probability space, and 
(3~ut ^ 0) be an increasing filtration of cr-fields T± C T with Tq being 
complete with respect to P, T. Let K(r,t) and L(r,t) be two nonnegative 
deterministic function defined for r, t > 0. Assume that they increase in r 
and are locally integrable in i, so that 

[ (K(r, t) + L(r,t))dt < oo Vr,T g(0,oo). 
J o 

Let S(t, x) be a nonnegative deterministic function defined for t > and 
x £ M. d and satisfying 5(t,x) < K(\x\,t). Define A(t,x) as the set of all 
symmetric nonnegative d x (i-matrices a such that 

5(t,x)\X\ 2 < ji^X < K(\x\,t)\X\ 2 VA E R d . 

Here, as well as everywhere in the article apart from Section we use the 
summation convention. For any symmetric d x ti-matrix v = (vij) define 

F(t,x,v) = sup a^Vij. 

a£A(t,x) 

As is easy to see, if \i(v), i = 1, d, are eigenvalues of v numbered in any 
order, then 



F(t, x ,v) = x, \i{v)), 



i=i 
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where x(i, x, A) = -K" (|x| , i) A for A > and x(t, x > A) = 8(t, x)X for A < 0. 

Remember that C^°(M. d+1 ) is the set of all infinitely differentiable real- 
valued function u = u(t, x) on R rf+1 with compact support. 

Theorem 5.1. Let x t , t > 0, 6e an M. d -valued Tt-adapted continuous pro- 
cess such that, for any u E Cg°(R <i+1 ), i/ie following process is a local Tt- 
supermartingale: 

u(t,x t )- / [u s (s,x s ) + F(s,x s ,u xx (s,x s )) + L(x*,s)\u x (s,x s )\] ds, 
Jo 

(5.1) 

where u x is the gradient of u with respect to the matrix of second- 

order derivatives u x t x j of u, 

u s = du/ds, u x i x j = d 2 ujdx l dx? . 

Then for any r,T S (0, oo) there exists a constant N < oo, depending only 
on r,L(r,T), and d (but not on K(r,t)) ; such that, for any nonnegative 
Borel f(t,x), we have 

r TAr r 

E / 8 d ^ d+l \t,x t )f(t,x t )dt < Ar||/|| Ld+l([ o,T]x BT .), (5-2) 
J o 

where 

ii/ik^ao^) = ( r / / d+i (*^)^) i/(d+i) . 

JO J\x\<r 

B T is the open ball in M. d of radius r centered at the origin, and r r is the 
first exit time of xt from B r . 

Proof. First of all notice that for any u G C X) (M. d+1 ) expression (l5.l| ) 
makes sense. Indeed, if r is such that u(t, x) = for \x\ > r and all t, then 
the integrand is bounded by a constant times 

f [1 + K(r,s) + L(x* t +r,s)]ds, 
Jo 

which is finite since each trajectory of x s is bounded on [0,t\. Also observe 
that usual approximation techniques allows us to only concentrate on the 
case of infinitely differentiable functions / > vanishing for |x| > r for some 
r. We fix r, such a function /, and a nonnegative function ( G Cg°(IR rf+1 ) 
with unit integral and support in the unit ball of M. d+1 centered at the origin. 
Below, for any locally bounded Borel function g(t, y) and e > we use the 
notation 

9 {e) =9*(e, where a;) = e" d_1 C(t/e, x/e). 

Next, we need Theorem 2 of 0, which states the following. There exist 
constants a = a(d) > and N r = N(r, d) < oo and there exists a bounded 
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Borel nonpositive function z on R d+1 which is convex on i?2r for each fixed 
t and is such that, for each nonnegative symmetric matrix a, 

a(det o) 1 /^ 1 )/^) < 4 e) + a ij z { Xi for e ^ r ' * G M ' N ^ r ' ( 5 - 3 ) 

|4 e) | < 2r _1 |z (£) l for e < r/2,t G R, |x| < r, (5.4) 

N <Ar r ||/|| id+l(RxBr) in Rx5 2r . (5.5) 

Notice that in Theorem 2 of pj there is the minus sign in front of z^ . 



However, ( |5.3| ) is true as is, since one can replace t with — t and this does not 
affect any other term. Observe that ([5l]) obviously implies that for e < r, 
we have 

k^l^iVrll/IU^CKxfl,.) m RxB r . (5.6) 
Fix an e > 0. We claim that the process 

rtATr 

Zf-=-z {£ HtAT r ,x tATr )- [-z^(s,x s ) (5.7) 

Jo 

+F(s,x s ,-z&(s,x s )) + L(r,s)\zi e) (s,x s )\] ds 

is a local supermartingale. To prove the claim it suffices to prove that 
( |5.7[) is a local supermartingale on [0, T] for every T G [0, oo). Fix a T G 
[0, oo) and concentrate on i G [0, T]. Change — z^ outside of [0, T] x B r in 
any way with the only requirement that the new function, say u belong to 
Co°(R d+1 ). Then the process ( |5.lD is a local supermartingale. Replacing t 
with t Ar r yields a local supermartingale again. Also observe that subtracting 
an increasing continuous process from a local supermartingale preserves the 
property of being a local supermartingale. After noticing that for < s < 
t A r r < T, we have |x s | < r and L(x*, s) < L(r, s) and we conclude that 

1-tA.Tr 

Tft:=u(tAT r ,X tA Tr)- [-4 £ H S i X s) 

Jo 

+F(s, x s , -4 £ 2 (s, x»)) + s)|4 £) (s, x s )|] ds 
is a local supermartingale on [0, T]. Since 

Vt~£,t = [u(0,Xq) - z ie \0,x )]I Tr=0 , 

is a bounded martingale, (|5.7|) is a local supermartingale indeed. 

After having proved our claim we notice that for each T G [0, oo) the 
process (5/7) is obviously bounded on [0,T]. Therefore (p/7| ) is a super- 
martingale and 

^T/ Tr >0 < Wr r >0 < Slip |Z (£) (0,X)|, 

|x|<r 



which along with ( |5.6[ ), (|5.4|), and the fact that z < 0, yields that for any 
e < r/2 

l-T/\T r 

E / [zi £) (s,x s ) -F(s,x s ,-z&(s,x s ))] ds 
Jo 
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<N r \\f\\ Ld+MxBr) {l + 2r- l E / L(r,s)ds). 

Jo 

Here, owing to (|5.3| ), 



- F(s, x, -z£>) = inf \z® + a ij z {E ) ,1 
> f^a inf (deta) 1 /^ 1 ) = /( £ W d /( d+1 ). 

a£A(s,x) 

rtAr r 

E / <^ +1 V (£) (^)^ < iV||/|| Ld+l([0 , r]xBr) 
J o 

Ar = Ar rQ - 1 (i + 2r~ 1 / L(r,s)ds). 

Jo 

Finally we let e J. and use the continuity of / which guarantees that 
/(e) _> /. Then upon remembering that / > and using Fatou's theorem, 
we arrive at (|5.2| ) with the above specified N. The theorem is proved. 



Hence 



with 



Remark 5.1. Actually, we did not use the continuity of xt- We could have 
only assumed that xt is a separable measurable process. However, then it 
turns out that the assumption about the processes (5.1) implies that xt is 
continuous anyway and moreover that xt is an Ito process (see ]l0|). 
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